CDS
Accession Number | TCMCG075C25930 |
gbkey | CDS |
Protein Id | XP_017983039.1 |
Location | complement(join(4652740..4652850,4653499..4653826,4653901..4654106,4654193..4654303,4654688..4654798,4654906..4655015,4655114..4655223,4655335..4655510,4655800..4655964,4656079..4656197,4656501..4656588,4656707..4656812,4657009..4657097,4657664..4657807,4657961..4658053,4658176..4658242,4658374..4658486,4658593..4658688,4658842..4659018)) |
Gene | LOC18588401 |
GeneID | 18588401 |
Organism | Theobroma cacao |
Protein
Length | 839aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018127550.1 |
Definition | PREDICTED: beta-galactosidase [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | G |
Description | beta-galactosidase |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE | - |
KEGG_ko | - |
EC | - |
KEGG_Pathway | - |
GOs |
GO:0003674
[VIEW IN EMBL-EBI] GO:0003824 [VIEW IN EMBL-EBI] GO:0004553 [VIEW IN EMBL-EBI] GO:0004565 [VIEW IN EMBL-EBI] GO:0005575 [VIEW IN EMBL-EBI] GO:0005618 [VIEW IN EMBL-EBI] GO:0005622 [VIEW IN EMBL-EBI] GO:0005623 [VIEW IN EMBL-EBI] GO:0005737 [VIEW IN EMBL-EBI] GO:0005773 [VIEW IN EMBL-EBI] GO:0015925 [VIEW IN EMBL-EBI] GO:0016787 [VIEW IN EMBL-EBI] GO:0016798 [VIEW IN EMBL-EBI] GO:0030312 [VIEW IN EMBL-EBI] GO:0043226 [VIEW IN EMBL-EBI] GO:0043227 [VIEW IN EMBL-EBI] GO:0043229 [VIEW IN EMBL-EBI] GO:0043231 [VIEW IN EMBL-EBI] GO:0044424 [VIEW IN EMBL-EBI] GO:0044444 [VIEW IN EMBL-EBI] GO:0044464 [VIEW IN EMBL-EBI] GO:0071944 [VIEW IN EMBL-EBI] |
Sequence
CDS: ATGTGGAACAGAGACATGTTGTCAAGGGTCACCGTGTTCATGTTATGGCTATTGTTTTCTTCTTGGGTTTTTTCAGTTTCAGCTACTGTTTCTTATGACAGTAAAGCTATCATCATTAATGGCAGGAGAAGGATTCTTCTTTCTGGCTCCATTCATTACCCCAGAAGCACTCCGCAGATGTGGCCTGATCTTATAGCAAAGGCTAAAGAAGGAGGCTTGGATGTTATACAAACTTATGTTTTCTGGAACGGACACGAGCCTTCTCCTGGAAAATATTATTTTGACGATAGGTATGATCTGGTTCGATTTATTAAGCTGGTGCAACAGGCTGGACTTTATGTTCATCTCCGGATTGGTCCCTATGTTTGTGCTGAATGGAACTTTGGGGGATTTCCTGTGTGGCTGAAATATGTCCCCGGCATTGTTTTCAGGACAGACAATGGACCTTTCAAGGCTGCAATGCAAAAATTCACAGAGAAGATAGTCAGCATGATGAAAGCAGAAAAGCTGTTTCAGACTCAAGGAGGTCCAATAATTATGTCTCAGATTGAAAATGAATTTGGTCCTGTTGAATGGGAAATTGGTGCTCCAGGTAAAGCTTACACCAAATGGGCTGCACAAATGGCAGTGGGACTTGGCACTGGAGTCCCATGGATTATGTGCAAGCAAGATGATGCTCCTGACCCTGTGATAAACACCTGCAATGGATTCTACTGTGAAAATTTTACTCCCAACGCGAAATACAAACCAAAGATGTGGACAGAGAACTGGACTGGCTGGTTTACAGAGTTTGGTGGTGCTGTCCCTACCAGACCTGCAGAAGACATAGCATTTTCAGTTGCACGATTCATTCAGAATGGTGGTTCATTTGTTAATTATTATATGTACCATGGAGGAACCAATTTTGGGCGGACAGCTGGTGGTCCCTTCATTGCTACCAGCTATGACTATGATGCTCCTATTGATGAATATGGGCTACCAAGGGAACCAAAATGGGGACATCTGAGAGATTTGCATAAAGCCATCAAATTAAGTGAACCAGCTTTAGTTTCTGCAGATCCTACCGTGACTTCACTTGGAAGTAATCAGGAGGCTCACGTATTCAAGGCAAAGTCTGGTGCATGTGCTGCATTCCTTGCAAACTATGACACAAAATACTCTGTAAAAGTAACTTTCGGAAATGTGCAATATGACTTACCAGCTTGGTCCATCAGCATCCTTCCCGACTGTAAAACTGCTGTTTTCAACACTGCCAGGCTTGGTGCCCAAAGCTCACAAAAGAAGATGGAAACTGTAAACAGCGCATTCTCTTGGCAATCATATAATGAAGAAAGCCCCTCTGCTGATGATCAGGATGCAACTGTAAAAGACGGGCTCTTGGAACAGATATATGTCACCAGAGATGCTTCAGATTATTTGTGGTACATGACAGATGTACAAATAGATCCTAATGAAGGATTTTTGACAAGTGGACAAGATCCTTCTCTGACCATTTGGTCAGCAGGTCATGCTTTGCATGTTTTCATTAATGGTCAATTATCCGGGACTGCGTATGGGGAATTGGACAATCCAAAATTAACATTCAGCAAAAATGTCAAACTACGAGCTGGGATTAACAAGATTTCTTTATTAAGCATTGCAGTGGGACTTCCAAATGTTGGCGTTCATTTTGAGACATGGAATGCTGGGGTTCTAGGTCCTGTTACATTGAAGGGTCTCAATGAGGGGTCAAGAGACTTATCTAAGCAGAAATGGTCTTACAAGATTGGTCTAAAAGGGGAGGCCTTAAGCCTTCATACCGTTACTGGAAGCTCCTCTGTTGAATGGGTCAAAGGATCGCTATTGGTAAAGAAACAACCTATGACTTGGTACAAGACAACTTTTAATGCACCGGGTGGCAATGAACCATTGGCTTTAGATATGAGTAGCATGGGAAAAGGGCAAATATGGATAAATGGCCAGAGCATTGGACGCCACTGGCCTGGATATATAGCACGTGGTGCGTGTGGTGCTTGTGATTATGCTGGAACTTATAGTGATAAGAAATGCCGAACTAATTGTGGAGAGCCGTCTCAAAGATGGTACCATGTTCCACGCTCATGGCTGAACCCAAGTGGAAACCTCATGGTTGTGTTTGAAGAATGGGGTGGTGATCCATCTGGAATTTCTTTGGTCAAAAGAACAACCGGAAGTGTTTGTGCTGATATTTTTGAAGCGCAACCAACAATGAAGAATTGGGGAATGCTAGCTTCTGGCAAAATCAATCGACCCAAAGCCCATTTGTGGTGTCCTCCTGGGCAGAAAATTTCTGAAATAAAGTTTGCTAGTTATGGAATGCCCGAGGGGACTTGTGGAAGCTTTAGTGAGGGAAGCTGCCATGCCCACAGGTCATATGATGCGTTTCAAAAGAATTGCATTGGAAAACAATCATGTTCGGTAACTGTGGCTCCAGAAGTTTTTGGAGGAGATCCATGTCCAGATAGCATGAAGAAGCTCTCAGTTGAAGCTGCCTGCAACTGA |
Protein: MWNRDMLSRVTVFMLWLLFSSWVFSVSATVSYDSKAIIINGRRRILLSGSIHYPRSTPQMWPDLIAKAKEGGLDVIQTYVFWNGHEPSPGKYYFDDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTDNGPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIIMSQIENEFGPVEWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQDDAPDPVINTCNGFYCENFTPNAKYKPKMWTENWTGWFTEFGGAVPTRPAEDIAFSVARFIQNGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLPREPKWGHLRDLHKAIKLSEPALVSADPTVTSLGSNQEAHVFKAKSGACAAFLANYDTKYSVKVTFGNVQYDLPAWSISILPDCKTAVFNTARLGAQSSQKKMETVNSAFSWQSYNEESPSADDQDATVKDGLLEQIYVTRDASDYLWYMTDVQIDPNEGFLTSGQDPSLTIWSAGHALHVFINGQLSGTAYGELDNPKLTFSKNVKLRAGINKISLLSIAVGLPNVGVHFETWNAGVLGPVTLKGLNEGSRDLSKQKWSYKIGLKGEALSLHTVTGSSSVEWVKGSLLVKKQPMTWYKTTFNAPGGNEPLALDMSSMGKGQIWINGQSIGRHWPGYIARGACGACDYAGTYSDKKCRTNCGEPSQRWYHVPRSWLNPSGNLMVVFEEWGGDPSGISLVKRTTGSVCADIFEAQPTMKNWGMLASGKINRPKAHLWCPPGQKISEIKFASYGMPEGTCGSFSEGSCHAHRSYDAFQKNCIGKQSCSVTVAPEVFGGDPCPDSMKKLSVEAACN |